Characterizing one-class datasets

نویسندگان

  • David M.J. Tax
  • Robert P.W. Duin
چکیده

This paper aims at characterizing classification problems to find the main features that determine the differences in performance by different classifiers. It is known that, using the disagreements between the classifiers, a distance measure between datasets can be defined. The datasets can then be embedded and visualized in a 2-D scatterplot. This embedding thus reveals the structure of the set of problems. In this paper we focus on a specific pattern recognition problem, the problem of outlier detection or one-class classification, where classifiers have to detect if a new object resembles the training data or not. For this problem the outputs of many classifiers on many datasets are available. By inspecting the scatterplot of the datasets, two main features appear to characteristize the datasets; (1) their effective sample size and (2) the class overlap. By generating artificial datasets for which these variables are varied, these observations are confirmed experimentally.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Characterizing sub-topical functions

In this paper, we first give a characterization of sub-topical functions with respect to their lower level sets and epigraph. Next, by using two different classes of elementary functions, we present a characterization of sub-topical functions with respect to their polar functions, and investigate the relation between polar functions and support sets of this class of functions. Finally, we obtai...

متن کامل

ارائه یک روش فازی-تکاملی برای تشخیص خطاهای نرم‌افزار

Software defects detection is one of the most important challenges of software development and it is the most prohibitive process in software development. The early detection of fault-prone modules helps software project managers to allocate the limited cost, time, and effort of developers for testing the defect-prone modules more intensively.  In this paper, according to the importance of soft...

متن کامل

(c,1,...,1) Polynilpotent Multiplier of some Nilpotent Products of Groups

In this paper we determine the structure of (c,1,...,1) polynilpotent multiplier of certain class of groups. The method is based on the characterizing an explicit structure for the Baer invariant of a free nilpotent group with respect to the variety of polynilpotent groups of class row (c,1,...,1).

متن کامل

Mammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease

Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...

متن کامل

Detection of Abnormal Events via Optical Flow Feature Analysis

In this paper, a novel algorithm is proposed to detect abnormal events in video streams. The algorithm is based on the histogram of the optical flow orientation descriptor and the classification method. The details of the histogram of the optical flow orientation descriptor are illustrated for describing movement information of the global video frame or foreground frame. By combining one-class ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006